<?xml version="1.0" encoding="ISO-8859-1"?>
<metadatalist>
	<metadata ReferenceType="Conference Proceedings">
		<site>sibgrapi.sid.inpe.br 802</site>
		<holdercode>{ibi 8JMKD3MGPEW34M/46T9EHH}</holdercode>
		<identifier>8JMKD3MGPEW34M/45CU66B</identifier>
		<repository>sid.inpe.br/sibgrapi/2021/09.06.18.15</repository>
		<lastupdate>2021:09.06.18.15.01 sid.inpe.br/banon/2001/03.30.15.38 administrator</lastupdate>
		<metadatarepository>sid.inpe.br/sibgrapi/2021/09.06.18.15.01</metadatarepository>
		<metadatalastupdate>2022:09.10.00.16.17 sid.inpe.br/banon/2001/03.30.15.38 administrator {D 2021}</metadatalastupdate>
		<citationkey>MaiaVieiPedr:2021:ViRhCo</citationkey>
		<title>Visual rhythm-based convolutional neural networks and adaptive fusion for a multi-stream architecture applied to human action recognition</title>
		<format>On-line</format>
		<year>2021</year>
		<numberoffiles>1</numberoffiles>
		<size>939 KiB</size>
		<author>Maia, Helena de Almeida,</author>
		<author>Vieira, Marcelo Bernardes,</author>
		<author>Pedrini, Helio,</author>
		<affiliation>UNICAMP</affiliation>
		<affiliation>UFJF</affiliation>
		<affiliation>UNICAMP</affiliation>
		<editor>Paiva, Afonso,</editor>
		<editor>Menotti, David,</editor>
		<editor>Baranoski, Gladimir V. G.,</editor>
		<editor>Proença, Hugo Pedro,</editor>
		<editor>Junior, Antonio Lopes Apolinario,</editor>
		<editor>Papa, João Paulo,</editor>
		<editor>Pagliosa, Paulo,</editor>
		<editor>dos Santos, Thiago Oliveira,</editor>
		<editor>e Sá, Asla Medeiros,</editor>
		<editor>da Silveira, Thiago Lopes Trugillo,</editor>
		<editor>Brazil, Emilio Vital,</editor>
		<editor>Ponti, Moacir A.,</editor>
		<editor>Fernandes, Leandro A. F.,</editor>
		<editor>Avila, Sandra,</editor>
		<e-mailaddress>helena.maia@ic.unicamp.br</e-mailaddress>
		<conferencename>Conference on Graphics, Patterns and Images, 34 (SIBGRAPI)</conferencename>
		<conferencelocation>Gramado, RS, Brazil (virtual)</conferencelocation>
		<date>18-22 Oct. 2021</date>
		<publisher>Sociedade Brasileira de Computação</publisher>
		<publisheraddress>Porto Alegre</publisheraddress>
		<booktitle>Proceedings</booktitle>
		<tertiarytype>Master's or Doctoral Work</tertiarytype>
		<transferableflag>1</transferableflag>
		<keywords>action recognition, visual rhythm, multi-stream architecture.</keywords>
		<abstract>In this work, we address the problem of human action recognition in videos. We propose and analyze a multi-stream architecture containing image-based networks pre-trained on the large ImageNet. Different image representations are extracted from the videos to feed the streams, in order to provide complementary information for the system. Here, we propose new streams based on visual rhythm that encodes longer-term information when compared to still frames and optical flow. Our main contribution is a stream based on a new variant of the visual rhythm called Learnable Visual Rhythm (LVR) formed by the outputs of a deep network. The features are collected at multiple depths to enable the analysis of different abstraction levels. This strategy significantly outperforms the handcrafted version on the UCF101 and HMDB51 datasets. We also investigate many combinations of the streams to identify the modalities that better complement each other. Experiments conducted on the two datasets show that our multi-stream network achieved competitive results compared to state-of-the-art approaches.</abstract>
		<language>en</language>
		<targetfile>camera_ready.pdf</targetfile>
		<usergroup>helena.maia@ic.unicamp.br</usergroup>
		<visibility>shown</visibility>
		<mirrorrepository>sid.inpe.br/banon/2001/03.30.15.38.24</mirrorrepository>
		<nexthigherunit>8JMKD3MGPEW34M/45PQ3RS</nexthigherunit>
		<citingitemlist>sid.inpe.br/sibgrapi/2021/11.12.11.46 4</citingitemlist>
		<hostcollection>sid.inpe.br/banon/2001/03.30.15.38</hostcollection>
		<agreement>agreement.html .htaccess .htaccess2</agreement>
		<lasthostcollection>sid.inpe.br/banon/2001/03.30.15.38</lasthostcollection>
		<url>http://sibgrapi.sid.inpe.br/rep-/sid.inpe.br/sibgrapi/2021/09.06.18.15</url>
	</metadata>
</metadatalist>